Categories

Versions

Filter Tokens (by Region) (Text Processing)

Synopsis

Filters tokens based on the region around another token.

Description

This operator keeps only tokens in a region of a specified token. Overlapping regions are kept as a whole, i.e. if the token occurs several times, the maximum regions around all tokens are built and the union of those regions is delivered.

Input

  • document

    The document port.

Output

  • document

    The document port.

Parameters

  • conditionThe condition a document must fulfill to be kept. Range:
  • stringThe string that should be compared to. Range:
  • regular_expressionThe regular expression for that should match. Range:
  • case_sensitiveSpecifies whether the comparison should be case-sensitive. Range:
  • invert conditionSpecifies whether comparison outcome should be inverted. Range:
  • tokens_beforeThe maximum number of tokens kept before the specified token. Range:
  • tokens_afterThe maximum number of tokens kept after the specified token. Range: